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Abstract 

When approximating the joint distribution of the component counts of a de- 
composable combinatorial structure that is 'almost' in the logarithmic class, but 
nonetheless has irregular structure, it is useful to be able first to establish that the 
distribution of a certain sum of non-negative integer valued random variables is 
smooth. This distribution is not like the normal, and individual summands can con- 
tribute a non-trivial amount to the whole, so its smoothness is somewhat surprising. 
In this paper, we consider two coupling approaches to establishing the smoothness, 
and contrast the results that are obtained. 
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1 Introduction 

Many of the classical random decomposable combinatorial structures, such as random 
permutations and random polynomials over a finite field, have component structure sat- 
isfying a conditioning relation: if C\ n) denotes the number of components of size i, the 
distribution of the vector of component counts (C[ n) , . . . , C^ n) ) of a structure of size n can 
be expressed as 

J?(C{*\ . . . , CT) = J?(Z X , . . . , Z n I T 0>n = n) , (1.1) 

where (Zj, i > 1) is a fixed sequence of independent non-negative integer valued random 
variables, and T aj „ := X}IL a +i^> < a < n. If, as in the examples above, the Zj also 
satisfy 

iP[Zi = l] ->■ 6 and 0, := iEZ { ->■ 0, (1.2) 

the combinatorial structure is called logarithmic. It is shown in Arratia, Barbour & 
Tavare (2003) [ABT] that combinatorial structures satisfying the conditioning relation 
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and slight strengthenings of the logarithmic condition share many common properties. 
For instance, if is the size of the largest component, then 

n~ l L^ -+ A L, (1.3) 

where L has probability density function fg(x) := e 7 T{9 + l)x e ~~ 2 pg((l — x)/x), x G 
(0,1], and pg is the density of the Dickman distribution Pq with parameter 9, given in 
Vervaat (1972, p. 90). Furthermore, for any sequence (a n , n > 1) with a n = o(n), 

lim d TV (jSf(Cf>, . . . , C^),^(Z U ...,Z an ))=0. (1.4) 

n— >-oo 

Both of these convergence results can be complemented by estimates of the approximation 
error, under appropriate conditions. 

If the logarithmic condition is not satisfied, as in certain of the additive arithmetic 
semigroups introduced in Knopfmacher (1979), the results in [ABT] are not directly ap- 
plicable. However, in Manstavicius (2009) and in Barbour & Nietlispach (2010) [BN], 
it is shown that the logarithmic condition can be relaxed to a certain extent, without 
disturbing the validity of (ll.4p . and that (II. 3p can also be recovered, if the convergence 
in (II ,2p is replaced by a weaker form of convergence. A key step in the proofs of these 
results is to be able to show that, for sequences a n = o(n), the normalized sum n~ 1 T an n 
converges both in distribution and locally to the Dickman distribution Pg, and that the 
error rates in these approximations can be controlled. To do so, it is in turn necessary to 
be able to show that, under suitable conditions, 

lim d TV (^(T an<n ),^(T an , n + 1)) = , for all a n = o(n), (1.5) 

and that the error rate can be bounded by a power of {(a n + l)/n}. In this note, we 
explore ways of using coupling to prove such estimates, in the simplest case in which 
the Zi have Poisson distributions. The first of these, an improvement over the Mineka 
coupling, was introduced in [BN]. It is extremely flexible in obtaining error rates bounded 
by a power of {(a n + l)/n} for a wide variety of choices of the means 9i, and it is in no 
way restricted to Poisson distributed Z{S. Here, we show that, despite its attractions, 
it does not achieve the best possible error rate under ideal circumstances. The second 
approach works only in much more restricted situations, but is then capable of attaining 
the theoretically best results. 

In the case of Poisson distributed Zi, the distribution of T an is a particular compound 
Poisson distribution, with parameters determined by n and by the 9i, and it is tempting to 
try to approximate the distribution of n~ 1 T an n by first approximating by the distribution 
that would be obtained if 9i = 9 for alH. A natural way of obtaining compound Poisson 
approximation is then to use Stein's method (Barbour, Chen & Loh 1992). Difficulties 
arise, however, because the conditions of their Theorem 5 (needed to get useful bounds 
on the solution to the Stein equation) are not satisfied unless a = 0, and, even then, the 
bounds obtained are not as useful as they might be; better information for this particular 
case can be found in [ABT, Chapter 9]. And, even using this approach, it still seems 
necessary first to bound the error in (jl.5p . in order to obtain useful results. 
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2 A Mineka-like coupling 



Let {Xj} ieN be mutually independent Z-valued random variables, and let S n := Ym=i -^i- 
The Mineka coupling, developed independently by Mineka (1973) and Rosier (1977) (see 
also Lindvall (2002, Section 11.14)) yields a bound of the form 

d TV (j?(S n ),J?(S n + l)) < (|^ =1 ^)" 1/2 , (2.1) 

where 

«i := (l-drv^XO.JSf^ + l))); 

see Mattner & Roos (2007, Corollary 1.6). The proof is based on coupling copies {Xl} i£N 
and {X"} ieN of {Xj} igN in such a way that 

n 

V n := ^(Xi-XC), nen, 
i=i 

is a symmetric random walk with steps in { — 1, 0, 1}. Writing := 1 + J2]=i -^'j ~ <Si + 1 
and S'{ := Y! j=1 X'j ~ S h so that Vi + 1 = S[- S", the coupling inequality (Lindvall 1992, 
Section 1.2) then shows that 

d TV (J?(S n ),J?(S n + l)) < F[r>n] = P[K = {-1,0}], 

where r is the time at which {V n } ne i + first hits level —1, and the last equality follows from 
the reflection principle. However, this inequality gives slow convergence rates, if X« = iZi 
and the Zj are as described in the Introduction; typically, d w (jzf (iZj), J??(iZj + 1)) is 
equal to 1, and, if is taken instead to be (2i — l)Z 2 j_i + 2iZ 2 i, we still expect to have 
1 — <i TV (jz?(Xj),Jzf(Xj + 1)) x i -1 , leading to bounds of the form 

d TV (j?(T an , n ),J?(T an , n + 1)) = 0((log(n/{a n + 1}))- 1/2 ) • (2.2) 

The reason that the Mineka coupling does not work efficiently in our setting is that, 
once the random walk V n takes some value k, it has to achieve a preponderance of k + 1 
negative steps, in order to get to the state — 1, and this typically requires many jumps to 
realize. Since, at the i-th step, the probability of there being a jump is of order it thus 
takes a very long time for such an event to occur, and the probability of this not happening 
before time n is then relatively large. In [BN], the difficulty is overcome by observing that 
the Mineka random walk can be replaced by another Markov chain (V n ,n > 1), still 
constructed from copies (Z-, % > 1) and (Z", % > 1) of the original sequence, but now 
associated differently with one another. The basic idea is to note that, if Vi = k, then the 
random variables X[ +l := jZ] + (j + k + l)Z' j+k+1 and X? +1 := jZ'J + (j + k + l)Z'f +k+1 
can be coupled in such a way that X' i+1 — X" +1 G { — (k + 1), 0, (k + 1)}, for any j such 
that the indices j and j + k + 1 have not previously been used in the construction. Hence 
a single jump has probability 1/2 of making V reach —1. The construction starts as for 
the Mineka walk, but if the first jump takes V to +1, then the chain switches to jumps 
in {—2, 0, 2}; and subsequently, if V is in the state k = 2 r — 1, the chain makes jumps in 
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{— 2 r , 0, 2 r }. Clearly, this construction can be used with Zi ~ Po(i~ 1 9i), even when many 
of the 9i are zero. A number of settings of this kind are explored in detail in [BN]; for 
instance, when 9^ > 9* for all % in {rZ + + t} U {sZ + + u}, where r and s are coprime. Very 
roughly, provided that a non- vanishing fraction of the 9i exceed some fixed value 9* > 0, 
the probability that V reaches —1 before time n is of order n~ a , for some a > 0, an error 
probability exponentially smaller than that in (12.21) . 

Here, we make the following observation. Suppose that we have the ideal situation in 
which 9i = 9* > for every i. Then the probability that a coupling, constructed as above, 
should fail is at least of magnitude n~ e ' 'I 2 . In Section [21 it is shown that the total variation 
distance in (II. 5p is actually of order n - mm i e *M under these circumstances, so that the 
estimates of this distance obtained by the [BN] coupling are typically rather weaker. It is 
thus of interest to find ways of attaining sharper results. The coupling given in Section [3] 
is one such, but it is much less widely applicable. 

The coupling approach given in [BN] evolves by choosing a pair of indices Mn < M i2 
at each step i, with the choice depending on the values previously used: no index can be 
used more than once, and Mi 2 — Mn = Vi-i + 1, so that one jump in the right direction 
leads immediately to a successful coupling. Then, if (MjijM^) = {j,j + k + 1), the pair 
X[ and X" is constructed as above, by way of copies of the random variables jZj and 
(j + k + l)Zj + k+x- The probability of a jump taking place is then roughly 29* /(j + k + 1), 
and, if a jump occurs, it has probability 1/2 of taking the value — (k+1), leading to success. 
The main result of this section is the following lower bound for the failure probability of 
such a procedure. 

Theorem 2.1. For any coupling constructed as above, the probability F[F] that the cou- 
pling is not successful is bounded below by 

\n/2\ 

P[F] > JJ(l-|min{r/z,l/e}) x n~ 9 ' /2 . 
i=i 

Proof. In order to prove the lower bound, we couple two processes, one of which makes 
more jumps than the other. We start by letting (Ui, i > 1) be independent uniform 
random variables on [0, 1]. The first process is much as discussed above. It is defined by 
a sequence of pairs of indices Ma < Mj2, 1 < i < /*, from [n] := {ieN: i < n}, with 
I* < [n/2\ the last index for which a suitable pair can be found. No index is ever used 
twice, and the choice of (Mj^M^) is allowed to depend on ((Mji, Mj 2 ,Uj), 1 < j < i). 
We set Yi = I[U l < p(M a , M a )], where 

p{m 1 ,m 2 ) := 2e~ 9 * /mi (9* /m 2 ) e - e * /ni2 < 1/e, 

for mi < m>2, representing the indicator of a jump of ±(m2 — mi) being made by the first 
process at time i. For the second, we inductively define Ri := {p(l), . . . , p{i)} by taking 
Ro = and 

p(i) := max{r G [n/2] \ R^: 2r < M i2 }; 

we shall check at the end of the proof that p(i) always exists. (The second process, that 
we do not really need in detail, uses the pair {2p{i) — 1, 2p{i)) at stage i.) We then define 
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Zi := I[Ui < mm{6*/p(i), 1/e}], noting that p(M a ,M i2 ) < min{0*/p(i), 1/e}, entailing 
> a.s. for all z. Finally, let (J i; z > 1) be distributed as Be(l/2), independently of 

each other and everything else. 

The event that the first process makes no successful jumps can be described as the 

event 

F := Ij^Y^ 
[ i=i 

We thus clearly have 

r K2j 

where, for I* < i < n/2, we take p(i) := min{r G [n/2] \ Ri-i}, and := B4-1 U {/>(«)}■ 
But now the Zi, suitably reordered, are just independent Bernoulli random variables with 
means min{#*/r, 1/e}, 1 < r < n/2, and hence 

[n/2] 

P[F] > JJ(l-|min{r/z,l/e}) x n~ r/2 . 
i=i 

It remains to show that the p(i) are well defined at each stage, which requires that 

Si := {r e [n/2] \ : 2r < M i2 } ^ 0, 

1 < % < I*. For z = 1, m\2 > 2, so the start is successful. Now, for 2 < z < n/2, suppose 
that 

r(i — 1) := max{s: D {1, 2, . . . , s}}. 

Then 1,2, ...,r(i — 1) can be expressed as p(ii), p(i,2), ■ ■ ■ , p(i r (i-i)), for some indices 
z'i,Z2, • • • , zV(i-i)- For these indices, we have < 2r(z — 1) + 1, 1 < I < r{i — 1), since 
r(i — 1) + 1 ^ -Rj-i and, from the definition of /?(•), we could thus not choose p(z'z) < r(z — 1) 
if Mj i 2 > 2r(z — 1) + 2. Hence, also, M iu \ < 2r(i — 1) + 1, and, because all the M is are 
distinct, {M iltS , 1 < s < 2, 1 < I < r(i - 1)} is a set of 2r(z - 1) elements of [2r(z - 1) + 1]. 
Thus, when choosing the pair (Ma, M i2 ), there is only at most one element of [2r(z — 1) + 1] 
still available for choice, from which it follows that Mj2 > 2r(z — 1) + 2: so r(i — 1) + 1 G Si, 
and hence Si is not empty. □ 




3 A Poisson-based coupling 

In this section, we show that a coupling can be constructed that gives good error rates 
in ([LSD when Zj ~ Po(j-^*), for some fixed 6* > 0. If Zj ~ PotjT 1 ^-) with 9j > 6*, the 
same order of error can immediately be deduced (though it may no longer be optimal), 
since, for Poisson random variables, we can write T an = T* n + T", with T* n constructed 
from independent random variables Z* ~ Po(j -1 ^*), and with T' independent of T* n . 
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Because of the Poisson assumption, the distribution of T an := YTj= a +ijZj can equiv- 
alently be re-expressed as that of a sum of a random number N ~ Po(8*h an ) of indepen- 
dent copies of a random variable X having ¥[X = j] = l/{jh an }, a + 1 < j < n, where 
h an ■= Sj=a+ii _1 - Fix c > 1, define j r := [c r \, and set 

r := r (a) := [log c (a + l)~|, r\ : = n(n) := L lo g c n J- 

Define independent random variables (X ri , r < r < r\, i > 1) and (N r , r < r < r\), 
with N r ~ Po(8*h an p r ) and 

P[X ri = j] = l/{jh an p r }, j r <j< j r+1 , 

where 

jV+i-l ^ 

define P r := XlsL^Ps < 1- Then we can write T an in the form 

n-i Af r 

-^~an y ~t~ ^ ^ ^ ^ -^-rii 

r=ro i=l 

where Y is independent of the sum; the X ri represent the realizations of the copies of X 
that fall in the interval C r := [j r ,jr+i), and Y accounts for all X-values not belonging to 
one of these intervals. The idea is then to construct copies T' an and T" n of T an with T' an 
coupled to T" n + 1, by using the same N r for both, and trying to couple one pair X' ri and 
X' r \ + 1 exactly, declaring failure if this doesn't work. Clearly, such a coupling can only 
be attempted for an r for which N r > 1. Then exact coupling can be achieved between 
X' rl and X' r \ + 1 with probability 1 — l/{j r h an Pr}, since the point probabilities for X rl 
are decreasing. Noting that the p r are all of the same magnitude, it is thus advantageous 
to try to couple with r as large as possible. This strategy leads to the following theorem. 

Theorem 3.1. With Zj ~ Po(j-^*), j > 1, we have 

d TV (j?(T an ),J?(T an + l)) = 0({(a+l)/nf in- 1 ), 
ifO*^l;for 9* = 1, 

d TV (J?(T an ),J?(T an + l)) = 0({(a+l)/n} + n- 1 log{n/(a + l)}). 
Proof. We begin by defining 

B r ■= ( f| {N s = 0} j n {N r > 1}, r < r < n, 

\s=r+l / 

and setting B := (X s ^{ N s = °}- 0n the event B r, write X' r \ = X' rl - 1 if X' rl ^ j r , with 
X' r \ so distributed on the event A r := {X' rl = Jr} that its overall distribution is correct. 
All other pairs of random variables X' r/i and X" ti , (r 1 , i) G ([r , . . . , r± — 1] x N) \ {(r, 1)}, 
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are set to be equal on B r . This generates copies T' an and T" n of T an , with the property 
that T' an = T" n + 1, except on the event 

E := B U ( Q (B r n A r ) j . 

\r=ro / 

It is immediate from the construction that 

P[£ r ] = exp{-6*h an P r+1 }(l-e- e * h ^), r <r<n, 

and that P[_B ] = exp{— 0*h an P ro }; and P[A r | B r ] = l/{j r h an Pr}- This gives all the 
ingredients necessary to evaluate the probability 

n-l 

P[£] = P[B ] + ^P[S r ]P[A.|S r ]. 

r=r 

In particular, as r — )■ oo, j r ~ c r , /i a nPr ~ logc and h an P r+ i ~ ijiin) — r)logc, from 
which it follows that F[B r ] ~ c - e *( ri ( n )- r )(l - c~ r ), F[A r \ B r ] ~ l/{c r logc} and 

P[S ] ^ c - e * {ri{n) ~ ro{a)) x {(a + l)/n} e *. 

Combining this information, we arrive at 

n(n) — 1 

¥[E] x {( a + l)/n} r + ^ c~ r c~ r{ri(n) - r) . 

r=r (a) 

For 6 1 * > 1, the dominant term in the sum is that with r = ri(n) — 1, and it follows from 
the definition of ri(n) that then 

¥[E] x {(a + lj/nf+c^xKa + lj/nf+n- 1 . 

For 9* < 1, the dominant term is that with r = r (a), giving 

¥[E] x {(a + lV^+n-^a + l)^ 1 - 9 *) x {(a+l)/n} e *. 

For 8* = 1, all terms in the sum are of the same order, and we get 

F[E) x {(a + l)/n} + n- 1 log(n/(a + l)). □ 

Note that the element {(a + l)/n} appearing in the errors is very easy to interpret, 
and arises from the probability of the event that T an = 0, a value unattainable by T an + 1. 
Furthermore, the random variable T an has some point probabilities of magnitude n~ x 
[ABT, p. 91], so that n~ l is always a lower bound for the order of d TV [j£ (T an ), Jzf (T an + 1)) . 
Hence the order of approximation in Theorem 13. II is best possible if 6* ^ 1. However, for 
a = and 9* — 1, the point probabilities of T 0n are decreasing, and since their maximum 
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is of order 0(n 1 ), the logarithmic factor in the case 6* = 1 is not sharp, at least for 
a = 0. 

The method of coupling used in this section can be extended in a number of ways. For 
instance, it can be used for random variables Zj with distributions other than Poisson, 
giving the same order of error as long as d TV (JZ'(Zj), Po(9j/j)) = 0(j~ 2 ). This is because, 
first, for some K < oo, 

¥[B r ] < Kexp{-6*h an P r+1 }(l-e- 0th ^), r < r < n, 

and ¥[Bq] < K exp{—6*h an P ro }, where, in the definitions of the B r , the events {N s = 0} 
are replaced by {Zj = 0, j s < j < j s +i}- Secondly, we immediately have 

d TV (^f((Zj, j r <j< jr+l)), (Zj, j r <j< j r+ l)) = O^' 1 ), 

where the Zj ~ Po(9j/j) are independent, and hence that 

d^{T 3r ^ 3r ^)^{% r ^, r ^)) = 0(j; 1 ), 

where T rs is defined as T rs , but using the Zj. Thus, on the event B r , coupling can still be 
achieved except on an event of probability of order 0(j~ l ). 

It is also possible to extend the argument to allow for gaps between the intervals on 
which 9j > 6*. Here, for < C\ < C2, the intervals [j r , j r +\ — 1] can be replaced by intervals 
[a r , b r ], such that b r /a r > C\ and a r > kac r 2 for some k and for each 1 < r < R, say. The 
argument above then leads to a failure probability of at most 




If C®* > C2, the failure probability is thus at most of order 0(c± + l/{acf}); if of < C2, 
it is of order 0(c{ Re ). In Theorem 13.11 above, we have C\ = C2 = c, k = 1 and c~ R x 
(a + l)/n, and the results are equivalent. 

However, the method is still only useful if there are long stretches of indices j with 9j 
uniformly bounded below. This is in contrast to that discussed in the previous section, 
which is flexible enough to allow sequences 9j with many gaps. It would be interesting to 
know of other methods that could improve the error bounds obtained by these methods. 
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